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Abstract 

The  problem  of  unification  of  terms  is  log-space  complete  for  P.  In  deriving  this  lower  bound  no 
use  is  made  of  the  potentially  concise  representation  of  terms  by  directed  acyclic  graphs.  In 
addition,  the  problem  remains  complete  even  if  infinite  substitutions  arc  allowed.  A  consequence  of 
this  result  is  that  parallelism  cannot  significantly  improve  on  the  best  sequential  solutions  for 
unification.  The  ^duar^problcm  of  computing  the  congruence  closure  of  an  equivalence  relation  is 
also  log-space  complete  for  P.  However,  we  shew  that  for  the  problem  of  term  matching,  an 
important  subcase  of  unification,  there  is  a  good  parallel  algorithm  using  0(logun]  time  and 
processors  on  a  PRAM.  For  the  0(logt  n)  parallel  time  upper  bound  wc-assume'that  the  terms  are 
represented  by  directed  acyclic  graphs;  if  the  longer  string  representation  is  used'~wTobtam' an 
0(log  n)  parallel  time  bound^  i’j  1 


1.  Introduction 


Unification  is  an  important  step  in  resolution  theorem  proving  [R]  with  applications  to  a  variety 
of  symbolic  computation  problems.  In  particular,  unification  is  used  in  PROLOG  interpreters  [CM], 
type  inference  algorithms  [M],  and  term  rewriting  systems  [GKM].  Many  symbol  manipulation 
problems  are  inherently  difficult  and  thus  do  not  have  efficient  solutions.  Theorem  proven  and 
PROLOG  interpreten  do  not  always  give  us  the  answen  we  want  fast  enough.  One  way  to  combat 
the  difficulty  of  these  problems  is  by  coordinating  many  processors  to  solve  a  single  problem 
instance  by  working  on  several  subproblems  in  parallel.  Although  there  are  a  number  of  ways  to 
introduce  parallelism  into  interpreters  [S]  and  theorem  proven,  unification  is  a  prime  target  since  it 
is  the  most  commonly  repeated  operation  in  these  tasks.  However,  our  analysis  suggests  that  parallel 
unification  algorithms  will  not  perform  significantly  faster  than  the  best  sequential  algorithms  known 
(e.g.,  [PW]  runs  in  linear  time).  We  show  that,  unless  P£NC,  an  untikely  twist  of  complexity  theory 
[C],  no  parallel  algorithm  for  unification  will  run  in  time  bounded  by  a  polynomial  in  the  logarithm 
of  the  input  size,  and  using  a  number  of  processors  bounded  by  a  polynomial  in  the  size  of  the 
input  We  use  the  PRAM  of  [FW]  as  our  model  of  parallel  computation,  although  we  could,  just  as 
well,  have  used  any  other  "reasonable  parallel  model"  [J]. 


*  Supported  by  a  Bantrell  Fellowship.  2Supported  partly  by  NSF  grant  MCS-1210830  and  partly  by  ONR-DARPA  grant 
N00014-S3-K-0146  ^Supported  by  an  IBM  Fellowship 
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Informally,  two  symbolic  terms  s  and  t  arc  unifiablc  if  there  is  some  way  of  substituting 
additional  terms  for  variables  in  s  and  t  so  that  both  become  the  same  term.  All  occurences  of  a 
variable  x  in  both  s  and  t  must  be  replaced  by  the  same  term.  For  example,  the  terms  f(x,  x)  and 
f(g(y),  g(g(z)))  may  be  unified  by  substituting  g(z)  for  y  and  g(g(z))  for  x.  A  unification  problem  like 
"unify  Htj,  tj)  and  fitj,  i^)"  may  be  decomposed  into  two  subproblcms  "unify  tj  and  tj"  and 
"unify  t2  and  14".  However,  these  two  problems  cannot  be  solved  entirely  separately  in  parallel.  If 
some  variable  x  occurs  in  both  tj  and  14.  for  example,  then  the  solutions  to  the  subproblcms  must 
be  coordinated  so  that  both  substitute  the  same  term  for  x. 

There  are  several  variations  of  the  unification  problem.  For  example,  a  type  inference  algorithm 
may  construct  labeled  graphs  which  represent  terms  that  must  be  unified.  An  acceptable  result  of 
unification,  in  this  case,  may  be  a  labeled  graph  with  a  cycle.  Labeled  graphs  with  cycles  represent 
types  defined  by  recursion  [MPS],  or,  if  interpreted  as  terms,  represent  "infinite  terms".  Thus  one 
natural,  unrestricted  version  of  unification  is  to  allow  "infinite  terms”  to  be  substituted  for  variables. 
Using  the  "infinite  term”  f(f(f...)),  we  can  unify  x  and  f(x),  something  we  could  not  do  otherwise. 
Unrestricted  unification  also  appears  in  many  PROLOG  interpreters;  those  omitting  the  occur  test 
[CM].  Another  variation  on  unification  is  the  special  ease  in  which  the  labeled  graphs  are  from  a 
class  of  tree-like  directed  acyclic  graphs  (which  we  call  simple  dags).  The  complexity  of  unification 
on  simple  dags  is  precisely  the  complexity  of  unification  on  symbolic  (string)  representations  of 
terms,  as  opposed  to  the  complexity  as  a  function  of  the  size  of  more  concise  graph  representations. 
For  this  case  it  was  known  that  unification,  without  "infinite  terms",  is  co-NLOGSPACE-hard  [LS]. 
This  did  not  exclude  the  possibility  of  parallel  algorithms,  moreover  no  lower  bound  was  known  for 
unrestricted  unification. 

We  show  that  all  of  the  above  variants  of  unification  are  log-space  complete  for  P  [C,  Gl,  G2], 
and  hence  unlikely  to  have  nice  parallel  solutions.  The  nondeierministic  log-space  test  for 
ununifiability  in  [LS],  which  could  have  led  to  a  Oflog2  n)  parallel  time  solution,  is  sufficient,  but 
unfortunately  not  necessary  (see  Figure  3b  for  a  counterexample  to  this  test).  In  addition,  we  show 
that  the  related  problem  of  congruence  closure  [DST]  is  complete  for  P. 

One  important  special  case  of  unification  can  be  solved  quickly  in  parallel.  This  problem  called 
term  matching,  arises  in  term  rewriting.  A  term  s  matches  a  term  t  if  t  is  a  substitution  instance  of  s. 
The  rewrite  rule  l-*r  may  be  used  to  rewrite  a  term  t  whenever  /  matches  t  [GKM].  We  show  that 
matching  can  be  accomplished  in  log2-time  on  a  PRAM,  using  a  polynomial  number  of  processors. 
Our  algorithm  combines  parallel  transitive  closure  of  a  directed  acyclic  graph,  with  parallel 
computation  of  connected  components  of  an  undirected  graph  [HCS,  Ch].  Also,  matching  is  in 
NLOGSPACE,  and  for  simple  dags  it  is  in  DLOGSPACE 

Following  the  definitions  presented  in  Section  2,  we  will  discuss  labeled  graph  unification  in 
Section  3,  unification  for  simple  dags  and  congruence  closure  in  Section  4,  and  term  matching  in 
Section  S. 
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2.  Definitions 
II  Terms  and  Dags 

Let  I'  be  an  infinite  set  of  variables  x,y,/,xj....  and  /•'  an  infinite  set  of  function  symbols 
fg.h,fj.„.  We  assume  that  I  and  /•'  arc  disjoint.  l-lach  function  symbol  f  has  a  fixed  arity,  a 
nonnegative  integer  a(0.  A  function  symbol  g €/*  with  a(g)=0  is  called  a  constant.  Ihc  set  T  of 
terns  is  defined  inductively  by: 

a  variable  x€T  or  constant  g €/■'  is  a  term,  and 

if  fi£/'  and  tj . t.^  arc  terms,  then  fitj _ t^^)  is  a  term. 

Terms  may  be  represented  using  directed  acyclic  graphs  with  labeled  nodes  and.  possibly, 
multiple  labelled  arcs.  A  labeled  directed  graph  is  a  finite  directed  graph  G.  such  that: 

(1)  every  node  v  of  G  has  a  unique  label,  denoted  labcl(v),  with  labcl(v)€  l\JF, 

(2)  for  each  x€  V.  there  is  at  most  one  node  v  with  labc!(v)=x.  and  it  has  outdcgrcc  0, 

(3)  if  a  node  v  has  label  f€F.  with  arity  a(f)>0.  then  it  has  outdcgrcc  a(f).  and 
the  arcs  leaving  it  are  labeled  l,2,«..a(0* 

If  there  is  an  ait  labeled  i  from  node  u  to  node  v,  then  we  say  that  v  is  the  i-th  son  of  u. 
A  labeled  dag  G  is  a  labeled  directed  acyclic  graph.  Hie  leaves  of  G  are  the  nodes  of  outdegree  0; 
note  that  a  node  v  is  a  leaf  iff  label(v)  is  either  a  variable  or  a  constant  The  height  of  a  node  v  of  a 
dag  G  is  the  length  of  the  longest  path  from  v  to  a  leaf.  A  root  of  a  dag  is  a  node  of  indegree  0. 

If  G  is  a  labeled  dag.  we  can  associate  a  term  ty  with  any  node  v  of  G.  We  say  that 
v  represents  ty.  The  term  ty  is  defined  by  induction  on  the  height  of  v: 

if  v  is  a  leaf,  then  ty=label(v), 

if  v  has  sons  vj,...,vk,  and  label(v)=f,  then  ty»l(ty  „»,ty  ). 

The  definition  of  labeled  dags  above  ensures  that  ty  is  always  a  well-formed  term.  If  G  is  a 
labeled  directed  graph,  then  we  can  associate  an  infinite  term  ty  with  each  node  v  of  G  by  a  similar 
definition.  Since  we  only  consider  finite  graphs,  all  terms  represented  by  nodes  of  a  labeled  graph  G 
are  finite  ifT  G  is  acyclic.  If  G  is  a  labeled  dag  with  only  one  root  r,  then  we  say  that  G  represents 
the  term  tp 

The  representation  of  terms  by  labeled  dags  is  illustrated  in  Figure  1.  The  terms  g(x)  and  x  are 
represented  by  the  two  nodes  of  the  labeled  dag  in  Figure  la.  Doth  roots  in  Figures  lb,  lc  represent 
f(frx,  x),  f(x,  x)).  The  terms  h(x,  x,  y,  z)  and  h(g(y),  g(g(z)),  gfgfg^),  g(g2))  are  represented  by  the 
roots  of  Figure  Id.  In  Figure  1,  we  assume  that  a(f)= 2.  a(h)=4,  a(g)=l,  and  a(gj)=a(g2)=0. 

Although  each  node  of  a  labeled  dag  determines  a  single  term,  the  converse  is  not  true.  A  term 
t  can  be  represented  by  several  different  dags.  In  particular,  if  t  is  a  term  with  several  occurrences 
of  a  subterm  ij,  then  we  may  use  a  separate  subdag  for  each  occurrence  of  tj  in  t,  or  use  one 
subdag  for  all  occurrences;  cf.  Figures  lb  and  lc.  Since  a  repeated  subterm  need  be  represented 
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only  once,  it  is  possible  to  represent  some  very  long  terms  with  relatively  small  labeled  dags,  l  or 
example,  llie  dag  in  l-'igurc  1c  with  n  nodes  represents  a  term  with  (X-n)  symbols.  We  define  a 
class  of  labeled  dags  which  arc  no  more  concise  that  terms. 

A  simple  Jag  is  a  labeled  dag  G  such  that  the  only  nodes  of  G  with  indcgrcc  greater  than  1  arc 
leaves.  ’Ihus  every  node  of  a  simple  dag  that  is  not  a  leaf  or  a  root  must  have  indcgrcc  1.  Given  a 
term  t  (in  the  form  of  a  string  of  symbols),  we  can  construct  a  simple  dag  representing  l  in  linear 
time,  using  only  logarithmic  space.  Similarly,  given  a  simple  dag  G  with  a  single  root,  we  can  write 
out  the  term  represented  by  G  in  linear  time  and  logarithmic  space.  Moreover,  the  si/.c  of  a  simple 
dag.  measured  in  number  of  nodes  and  arcs,  is  within  a  constant  multiplicative  factor  of  the  length 
of  the  term  it  represents. 


12  Unification  and  Term  Matching 

Unification  and  term  matching  arc  both  problems  that  are  solved  by  computing  substitutions.  A 
substitution  a  is  a  mapping  from  variables  to  terms  such  that  o(x)=x  for  all  but  finitely  many  x€F. 
The  action  of  a  substitution  a  on  a  term  t,  written  <r(t).  is  the  result  of  replacing  each  variable  x  in  t 

by  o(x).  Thus  e(fU^ . tj,))=fl>(t]),...,<r(tj,)).  In  particular,  any  substitution  o  maps  every  function 

symbol  to  itself.  We  use  =  to  denote  syntactic  equality  of  strings. 

Two  terms  s  and  t  are  unifiable  if  there  exists  a  substitution  a  such  that  a(s)=a(t).  A  term  s 
matches  term  t  if  there  exists  a  substitution  a  with  o(t)~L 

In  some  instances  we  may  wish  to  allow  substitutions  to  map  variables  to  infinite  terms.  If  we 
allow  these  more  general  substitutions,  then  we  have  the  unrestricted  unification  and  unrestricted 
matching  problems.  Unrestricted  unification  differs  from  unification  (e.g..  in  Figure  la  x  and  g(x) 
are  ununifiable  but  unrestricted  unifiable  with  <r(x)=g(g(...))  an  infinite  term).  Unrestricted 
matching  and  matching  arc  the  same;  note  that  we  only  consider  substitutions  that  involve  infinite 
terms,  not  unification  of  infinite  terms  s  and  L 

If  o(s)=a(t),  then  a  is  called  a  unifier  for  s  and  L  A  substitution  a  is  more  general  than  a 
substitution  r  if  there  exists  a  substitution  p  with  r=p*a.  In  [R],  it  is  shown  that  whenever  terms  s 
and  t  are  unifiable,  there  is  a  unifier  a  for  s  and  t,  which  is  more  general  than  any  other  unifier. 
This  is  called  the  most  general  unifier  (mgu)  for  s  and  t  The  mgu  is  unique  up  to  renaming  of 
variables.  For  example,  consider  the  terms  s=f(x,  y)  and  t=fig(y).  g(z))  represented  in  Figure  If. 
These  terms  are  unifiable,  with  mgu  <r(x)=g(g(z)),  <r(y)=g(z),  and  a(z)=z;  then 
o(s)=o(t)=f(g(g(z)),  g(z)). 

Two  terms  s  and  t  arc  unifiable  if  a  certain  kind  of  relation,  can  be  constructed  on  the  nodes  of 
a  labeled  dag  representing  s  and  t.  If  u  and  v  are  two  nodes  of  a  labeled  dag  and  if  Uj  is  the  i-th 
son  of  u  and  vj  the  i-th  son  of  v,  for  some  i,  then  Uj.Vj  arc  corresponding  sons  of  u.v. 
A  relation  R  on  the  nodes  of  a  labeled  dag  is  a  correspondence  relation  if,  for  all  u,v,Uj,vj: 

uRv  -*  ujRvj  whenever  uj.vj  are  corresponding  sons  of  u,v. 
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A  correspondence  relation  that  is  also  an  equivalence  relation  will  be  called  a  t-c  relation. 
A  relation  K  is  homogeneous  if  labcKu)  and  labcl(v)  are  not  different  symbols  whenexer  uRv. 
An  equivalence  relation  R  on  nodes  of  a  labeled  directed  graph  G  is  acyclic  if  the  R-cquivalcnce 
classes  are  partially  ordered  by  the  arcs  of  G.  In  [PW],  acyclic,  homogeneous  c*c  relations  are  called 
valid  equivalence  relations.  These  relations  characterize  unifiability. 

Proposition  1:  (PW]  Let  u  and  v  be  nodes  in  a  labeled  dag  G.  Then  ty  and  ty  are  unrestricted 
unifiablc  iff  there  is  a  homogeneous  c-e  relation  R,  with  uRv.  Similarly,  ty  and  ty  arc  unifiablc  iff 
there  is  an  acyclic,  homogeneous  c-c  relation  R,  with  uRv.  □ 


If  R  is  an  acyclic,  homogeneous  c-c  relation  on  a  labeled  dag  G,  then  the  reduced  graph  formed 
by  treating  each  equivalence  class  as  a  single  node  is  again  a  labeled  dag.  If  u  and  v  arc  the  only 
two  roots  of  G,  and  uRv,  then  this  reduced  graph  with  a  single  root  represents  a  term  s  that  is  a 
substitution  instance  of  both  ty  and  ty.  If  R  is  the  minimal  c-e  relation  with  uRv,  then 
s=o(tu)=o(tv),  where  a  is  the  mgu  of  ty  and  ty  [PW].  We  can  extract  a  from  R  by  taking  a(x)  to 
be  the  term  in  the  reduced  graph  that  is  represented  by  the  node  formed  from  the  equivalence  class 
of  x.  We  can  therefore  consider  the  reduced  labeled  dag  as  a  reasonable  representation  of  a  unifier 
for  two  terms.  This  representation  of  a  unifier  has  the  virtue  of  being  compact;  it  is  clear  that  the 
reduced  graph  is  no  larger  than  the  original  dag.  However,  if  we  were  to  write. out  each  unifier 
explicitly,  we  might  end  up  writing  out  terms  that  were  much  longer  than  the  terms  represented  by 
the  input  dag.  An  example  in  [PW]  shows  that  the  length  of  the  substitution  may  be  an  exponential 
function  of  the  length  of  the  input  terms. 

As  in  [PW],  we  will  represent  equivalence  relations  on  the  nodes  of  labeled  dags  by  adding 
undirected  edges  to  the  labeled  dag  data-structure. 

Matching  may  be  viewed  as  a  special  case  of  unification.  Let  oc  be  a  substitution  such  that  for 
each  distinct  variable  x,  in  the  terms  we  are  examining,  ec(x)  is  c,,  a  distinct  constant  symbol  not 
appearing  in  these  terms.  It  is  easy  to  see  that  a  term  s  matches  a  term  t  iff  s  and  ac(t)  are 
unifiable.  Another,  degenerate  case  of  unification  is  to  determine  whether  two  terms  are  syntactically 
identical.  Of  course,  this  is  a  trivial  operation  on  strings,  but  it  is  not  quite  so  trivial  an  operation 
when  terms  are  represented  by  labeled  dags.  Gearly,  s  and  t  are  syntactically  equal  iff  oc(s)  and 
ac(t)  are  unifiable. 

In  summary,  using  the  labeled  dag  data  structure,  we  have  the  following  problems: 


UNIFY(G,u,v) 

Input:  A  labeled  dag  G  with  distinguished  nodes  u  and  v. 
Output:  Are  ty  and  ty  unifiablc? 

If  yes,  then  produce  a  labeled  dag  representing  the  mgu. 


MATCH(G,u,v):  'ITiis  is  UNlFY(G.u.v)  with  oc(ty)  instead  of  ty. 


EQUAL(G.u.v)):  This  is  UNlFY(G,u,v)  with  o^ty),  ®c(ty)  instead  of  ty.  ty. 

Of  course,  there  is  also  unrestricted  unification  UNIFY°°(G,u,v).  We  have  a  special  case  of 
each  of  the  above  problems  when  G  is  a  simple  dag. 


13  Parallelism,  NC  and  P 

For  sequential  computation  we  use  the  standard  definitions  for  time,  space,  space-bounded 
reductions  and  complexity  classes  such  as  P,  DLOGSPACE,  NLOGSPACE,  T(n)-DSPACE,  on  a 
Random  Access  Machine  (RAM)  [C].  We  denote  log-space  rcducibility  by  <j0g.  As  usual,  P  is  the 
class  of  languages  recognizable  in  deterministic  polynomial  time.  The  problems  UNIFY,  MATCH 
and  EQUAL  all  belong  to  P  [PW,  MM].  Some  may  be  solved  in  log^^n  space,  while  others,  those 
log-space  complete  for  P,  most  probably  cannot 

For  parallel  computation  we  use  the  Parallel  RAM  (PRAM)  of  [FW]  as  our  model,  with  parallel 
time  and  number  of  processors  as  the  critical  resources.  We  make  use  of  the  parallel  computation 
thesis,  relating  parallel  time  and  sequential  space,  and  its  proof  for  PRAM’S  [FW]: 

Uk>>o  logk(n)-parallel  time-PRAM  =  Uk^o  logk(n)-DSPACE. 

We  take  NC  to  be  the  class  of  problems  solvable  on  a  PRAM  using  log^^n  parallel  time,  and 
n®^  processors.  We  try  to  determine  whether  a  problem  in  P  is  "parallelizable"  (i.e.,  in  NQ  or 
"most  probably  not  parallelizable"  (i.e.,  log -space  complete  for  P);  [C,  J]  review  related  results. 

One  problem  that  is  log-space  complete  for  P  is  the  circuit  value  problem  for  monotone  circuits. 
A  monote  circuit  P  is  a  sequence  (pQ,..,Pn),  where  each  Px  is  either  an  input,  an  and-gate  AND(j.k), 
or  an  or-gatc  OR(j,k);  where  for  indices  j,k  we  have  i>j>k,  and  the  0,1  values  of  the  inputs  are  given 
explicitly.  In  addition,  monote  circuits  arc  assumed  to  have  the  following  properties: 

(1)  if  px  is  an  input,  then  the  index  i  appears  at  most  once  in  p,  (fan-out  ^1  for  inputs), 

(2)  if  Pi  is  a  gate,  then  the  index  i  appears  at  most  twice  in  p,  (fan-out  <2  for  gates), 

(3)  pn  is  an  or-gate  with  one  output. 

The  monotone  circuit  value  problem  is: 

MCV={/?|  p  is  a  monotone  circuit  with  the  output  value  of  pn  =0}. 

From  [31,  G2]  we  have: 

Proposition  2:  MCV  is  log-space  complete  for  P.  □ 


3.  Hu*  Complexity  of  ( tuificulion 

The  general  unification  problem,  encountered  in  theorem  proving  and  elsewhere,  is  to  find  a 
simultaneous  unifier  for  a  set  of  terms.  However,  the  general  ease  is  log-space  and  linear  time 
reducible  to  the  special  ease  of  unifying  a  single  pair  .of  terms  [PWJ.  On  a  PRAM  this  reduction  can 
be  performed  in  0(log  n)  parallel  time  and  with  0(n)  processors;  it  affects  none  of  our  results. 

We  first  describe  a  naive  unification  algorithm  based  on  the  criterion  of  Proposition  1.  and  on 
the  fact  that  the  mgu  is  the  minimal  c-c  relation  |PWJ.  The  input  to  the  algorithm  is  a  labeled  dag 
G  with  two  distinguished  nodes  u  and  v.  We  wish  to  solve  UNIFY(G.u.v).  A  relation  ®  is 
constructed  and  maintained  as  undirected  edges  in  G.  The  relation  ®  is  by  its  representation 
symmetric  and  reflexive.  In  order  to  make  ®  a  c-c  relation,  both  "correspondence”  and 
"equivalence”  must  be  satisfied.  Setting  sons  equivalent,  when  their  fathers  arc  equivalent,  is  known 
as  propagation.  For  ®  to  be  an  equivalence  relation  we  must  also  enforce  transitivity.  Having 
created  the  minimal  c*c  relation  ®  for  which  u®v.  we  then  test  for  homogeneity.  In  the 
affirmative  ease  a  new  labeled  graph  G'  can  be  constructed  by  coalescing  classes  of  nodes  in  G. 
Now  we  know  that  the  input  is  at  least  unrestricted  uniftablc.  If  G'  is  acyclic  it  is  unifiable. 

proc  naive-uni  fication(G,u,v) 

set  u®v  ; 

while  (®  is  not  a  c-e  relation)  do 

propagation:  while  (u®v  have  corresponding  sons  Uj.Vj  not  related  by  ®)  do  set  uj®Vj  od  ; 

transitivity:  while  (u®v  and  v®w,  but  u.w  are  not  related  by  ®)  do  set  u®w  od 

od  ; 

if  ®  not  homogeneous  then  print  UNUNIFIABLE 

else  {coalesce  equivalence  classes  to  produce  labeled  graph  G’} 
if  G’  has  a  cycle 

then  print  UNUNIFIABLE  BUT  UNRESTRICTED  UNIFIABLE 
else  print  UNIFIABLE 
fl 

fi 

proc  {G'  represents  mgu} 

In  this  algorithm  all  individual  steps  can  be  performed  on  a  PRAM  using  log^^n  time  and 
nO(l)  processors.  The  difficulty  arises  in  the  outer  loop,  the  body  of  which  is  executed  if  ®  is 
either  not  a  correspondence,  or  not  an  equivalence  relation,  i.e.,  if  either  condition  inside  an  inner 
loop  is  satisfied.  The  problem  is  that  on  an  input  of  size  n  the  body  of  the  main  loop  might  be 
executed  Q(n)  times.  This  behavior  is  illustrated  in  Figure  2.  The  example  can  easily  be  generalized 
to  force  the  Q(n)  alternation  between  propagation  and  transitivity  for  any  n. 


Theorem  I:  UNIFY(G.u.v)  and  UNIFY°°(G.u.v)  arc  log-sp.tcc  complete  for  P. 

Proof:  We  show  how  to  log-space  reduce  MCV  to  unifiahility  (for  membership  see  |PW)).  More 
spccilically.  if  a  is  a  monotone  circuit  {aQ.a].....an}.  we  construct  G(a),  u(a).  and  \(a)  such  that 
«€MCV  iff  UNIFY(G(«).u(oKv(o))=UNIFIABl.H. 

'litis  reduction  directly  applies  to  UNIFY00  and  is  easily  seen  to  use  only  log  space. 

The  monotone  circuit  a  can  be  represented  as  a  diagram  with  wires.  AND  and  OR  gates  of 
fan-in  2  and  fan-out  at  most  2.  a  special  OK  output  gate  with  one  output  wire,  and  with  each  input 
wire  leading  to  one  gate  and  hat  ing  a  0  or  a  1  value  (see  Figure  3a  for  an  example).  The  input  wire 
values  combine  to  produce  values  for  all  other  wires  and  the  output  wire  in  particular.  The  circuit 
has  no  feedback.  i.e..  if  the  wires  are  viewed  as  arcs  and  the  inputs  and  gates  as  nodes  we  get  a  dag 
without  multiple  arcs. 

(1)  Introduce  two  nodes  u(a).  v(a)  in  G(a). 

(2)  If  Oj  is  an  AND  gate  include  from  Figure  4a  in  G(a).  If  a;  is  an  OR  gate  include 
Gor  from  Figure  4b  in  G(a).  These  dags  have  two  pairs  of  input  nodes  and  one  pair  of  output 
nodes  each,  i.e.,  {INjj,  lN2i},  (INjj.  lN4i}.  and  {OUT^,  OUTjj}.  Corresponding  sons  are 
illustrated  by  the  labels  a,  b  on  the  arcs. 

(3)  If  a;  is  an  input  include  in  G (a)  a  pair  of  nodes  {OUTjj.  OUTjj}.  If  the  value  of  the  input 
is  1  then  make  OUTjj,  OUTjj  corresponding  sons  of  u(a).  v(a).  If  the  value  of  the  input  is  0  then 
make  OUTjj,  OUTjj  sons  of  u(a)  and  let  v(a)  have  two  sons  that  correspond  to  them  and  arc  two 
new  leaves  in  G(a). 

(4)  If  gate  oj  is  connected  to  aj.  (i.e„  in  the  wire  diagram)  then  identify  nodes 
IN u= OUTjj.  IN2j= OUTjj.  INjj=OUTj^f  IN4j= OUTjj..  When  these  subdags  are  concatenated 
nodes  have  outdegree  <2,  and  the  labels  on  the  ares  can  be  made  1  and  2,  so  that  the  equalities  of 
la^vis  a,  b  in  Figures  4a,  4b  is  preserved. 

(5)  In  the  dag  constucted  in  steps  1-4  above  assign  labels  to  the  nodes  as  follows: 
labelfu) = label(v) = h, 

label(node  of  outdegree  l)=g, 

labclfnode  of  outdegree  2)=f, 

label(OUTln)=gl*  g2=labeKOUT2n). 

labelfleaf  other  than  OUTjn,  OUT2n)= distinct  variable. 

We  can  easily  see  now  that  every  wire  w  in  the  wire  diagram  can  be  associated  to  a  pair  of 
nodes  OUT^  OUTw«.  We  require  u(a)®v(a).  For  such  a  minimal  c-e  relation  <9.  we  claim  that 
the  value  of  wire  w  in  a  is  1  iff  OUTw®  OUTw«.  This  certainly  holds  for  the  inputs,  because  of 
the  way  we  built  corresponding  sons  of  u  and  v.  Also,  it  is  trivial  to  check  that  G^  and  Gor 
simulate  the  behavior  of  AND  and  OR  gates.  Therefore  the  value  of  an  is  1  iff  OUTj„®OUT2n. 
The  graph  C\a)  u  constructed  in  such  a  way  that  the  only  place  homogeneity  could  be  violated  by 
®  is  if  OUTjn®OUT2n.  As  a  result,  if  a  si,  the  terms  represented  by  u(a)  and  v(a)  are  not 
unrestricted  uniflable,  and  if  a=0  they  are  unifiable  (the  acyclicity  condition  is  also  true).  □ 


4.  Simple  Dags  and  Congruence  Closure 

In  this  Section  we  will  make  our  lower  bounds  independent  of  the  potential}'  concise  dag 
representation  of  terms,  by  extending  them  to  simple  dags.  We  will  also  investigate  the  related 
problem  of  computing  the  congruence  closure  of  an  equivalence  relation. 

Theorem  2:  UNlFY(G.u.v)  and  UNIFY°°(G.u.v)  arc  log-space  complete  for  P.  even  when  G  is 
a  simple  dag. 

Proof:  Given  monotone  circuit  a  we  construct  a  simple  dag  G(a)  with  two  roots  u(a).  and  v(a) 
so  that,  if  <*„=0  then  the  terms  tu^aj,  t^j  arc  unifiablc  else  they  arc  not  unrestricted  unifiable. 
This  suffices  for  the  completeness  of  both  UNIFY  and  UNIFY00.  Note  that  the  proof  of 
Theorem  1  no  longer  applies,  because  the  G^.  dags  used  in  that  reduction  could  introduce  nodes 
with  indegree  2.  i.c..  their  output  nodes,  which  were  not  leaves. 

As  in  the  proof  of  Theorem  1,  we  encode  the  input  of  a  using  a  pair  of  nodes  for  each  circuit 
input.  The  input-subgraph  of  the  graph  of  Theorem  1  is  actually  a  simple  dag.  so  we  use  the  same 
construction.  However,  we  cannot  attach  "gates"  directly  to  the  input-subgraph  since  this  will 
produce  a  dag  which  is  not  simple.  Instead,  each  gate  will  be  constructed  separately  using  a  pair  of 
subgraphs.  Any  c-e  relation  ®  with  u(a)®v(a)  will  relate  the  two  parts  of  each  gate.  In  addition, 
the  input  nodes  of  one  gate  will  be  "connected"  to  input-subgraph  nodes  or  output  nodes  of  other 
gates  using  a  separate  "patch  board"  subgraph.  Recall  that  the  gates  of  a  are  numbered  so  that  if  an 
output  of  gate  aj  goes  to  an  input  of  gate  aj,  then  Kj. 

For  each  gate  of  o,  we  use  four  input  nodes  and  four  output  nodes.  For  gate  aj,  let  us  denote 
these  nodes  by  IN^,...,  IN^  and  OUTjj,...,  OUT^.  As  in  the  proof  of  Theorem  1,  the  nodes  of 
G(a)  work  in  pairs.  Inputs  INjj  and  IN^  represent  the  first  input  to  aj  and  INjj  and  IN4j  the 
second.  Similarly,  nodes  OUTjj  and  OUT^  represent  the  first  output  of  a-x  and  OUTjj  and  OUT^ 
the  second.  We  also  use  nodes  uj,  Vj  which  are  the  i-th  sons  of  roots  u(a)  and  v(a),  respectively, 
and  four  or  seven  internal  nodes  which  may  remain  anonymous. 

If  aj  is  an  OR-gate,  then  we  construct  a  simple  dag  GAl’Ej  as  in  Figure  Sa.  with  uj,  vj 
corresponding  sons  of  u(a),  v(a).  If  ®  is  a  c-e  relation  with  u(o)®v(a),  it  is  easy  to  see  that 
OUTjj®OUT2j  and  OUT3j®OUT4j  if  either  INjj®^,  or  IN3j®IN4|.  It  will  be  clear  from  the 
construction  of  G(a)  that  if  ®  is  minimal,  then  these  are  the  only  cases  in  which  the  output  nodes 
will  be  related  by  ®.  If  oj  is  an  AND-gate,  exactly  similar  reasoning  applies  for  the  simple  dag  of 
Figure  Sb,  which  simulates  the  logic  of  AND. 

The  remaining  task  is  to  "connect"  the  gates  so  that  if,  for  example,  the  first  output  of  aj  goes 
to  the  second  input  of  oj,  then  INjj®IN^|  whenever  OUT^OUT^.  We  use  an  example 
connection  between  aj  and  aj  to  illustrate  the  construction  of  a  "patch  board"  simple  dag  PATCH, 
which  contains  two  new  nodes  up,  vp  and  IN  and  OUT  nodes  from  the  input-subgraph  and  gate 
subgraphs  of  G(a).  Let  up,  vp  be  corresponding  sons  of  t>(a),  v(a),  different  from  the  sons  used  in 
the  gate  and  input  subgraphs.  Now  make  INjj  and  OUTjj  corresponding  sons  of  Up  and  vp;  also 


make  IN„|j  and  OUTjj  corresponding  sons  of  Up  and  \p  (see  figures  5c  and  5d).  When  u(o )®\(n). 
two  inpul  nodes  of  GATI-j  will  be  merged  if  the  right  two  output  nodes  of  GATEj  arc. 

As  in  die  proof  of  'nicorcm  1.  we  label  the  outputs  of  the  final  gate  with  different  constant 

symbols.  All  other  nodes  have  labels  that  depend  on  their  arity.  so  that  nodes  with  outdcgrcc  2,  say. 

have  the  same  label.  It  is  easy  to  verify  by  induction  that  in  the  minimal  c*c  relation  ®  with 

iKa)®v(a).  we  have  OUTjn®OUT2n  and  OUT3n®OUT4n  iff  the  output  of  the  last  gate  on  is  1. 

This  completes  the  proof  of  Theorem  2.D 
« 

Congruence  closure  is  a  practical  problem  that  is  in  many  ways  a  dual  to  unification.  In 
unification,  the  equivalence  classes  of  m  and  n  arc  merged  whenever  there  exist  some  equivalent  r 
and  s  such  that  m  and  n  arc  corresponding  sons  of  r  and  s.  In  congruence  closure,  the  equivalence 
classes  of  r  and  s  arc  merged  whenever,  for  all  pairs  of  corresponding  sons  m.  n.  we  have  that  m 
is  equivalent  to  n.  We  consider  a  pure  form  of  congruence  closure  in  which  the  node  labels  are 
ignored,  however,  the  arcs  must  still  be  labeled  so  that  we  can  see  which  sons  correspond.  Efficient 
algorithms  for  congruence  closure  arc  contained  in  [DST]. 

An  equivalence  relation  ©  on  the  nodes  of  a  labeled  graph  is  a  congruence  relation  when: 

if  u.v  have  same  outdcgrcc  and  for  each  pair  of  corresponding  sons  we  have  Uj©vj,  then  u©v. 
Given  any  equivalence  relation  R,  there  is  a  unique  minimal  congruence  relation  that  contains  R. 
called  the  congruence  closure  of  R.  An  equivalence  relation  R  can  be  represented  using  undirected 
edges  in  a  labeled  dag.  We  can  now  pose  the  following  language  recognition  problem: 
CONG={<G.u,v,R>|  nodes  u  and  v  of  labeled  dag  G  are  related  by  the  congruence  closure  of  R}. 

By  using  a  construction  that  resembles  that  of  Theorem  1  "turned  upside-down"  and  that 
exhibits  an  and/or  duality  between  unification  and  congruence  closure  we  can  show  that* 

Theorem  3:  The  language  CONG  is  complete  for  P. 

Proof:  Again  we  reduce  MCV  to  CONG.  The  wires  of  the  circuit  diagram  correspond  to  pairs 
of  nodes,  such  that,  the  two  nodes  are  related  in  ©  (the  congruence  closure)  iff  the  value  on  the 
wire  is  1.  Given  a  monotone  circuit  a  we  construct  a  dag  G(a),  an  equivalence  relation  R(a)  on  its 
nodes,  and  two  roots  of  the  dag  u(cr),  v(a).  We  wish  to  test  the  two  roots  for  equivalence  in  ©,  the 
congruence  closure  of  R(a). 

The  construction  is  bottom*up,  so  that  each  circuit  input  corresponds  to  a  pair  of  leaves  and 
each  OR  and  AND  gate  to  a  subgraph  with  two  pairs  of  input  and  two  pairs  of  output  nodes  each; 
the  last  OR-subgraph  has  only  one  pair  of  output  nodes  u(a).  v(a).  The  inputs  and  outputs  are 
connected  in  a  pattern  similar  to  that  of  the  proof  of  Theorem  1.  The  input  leaves  are  represented 
in  Figure  6a,  note  that  for  inputs  that  arc  1  the  two  leaves  arc  in  the  same  equivalence  class  of 
R(n).  The  OR-su^graph  is  in  Figure  6b,  and  two  pairs  of  internal  nodes  form  equivalence  classes  of 
R(a).  The  ANI>subgraph  is  in  Figure  Sc.  It  is  simple  to  verify  that  the  gate  subgraphs  simulate  the 
gate  logic  and  that  u (a)©v(o)  iff  the  output  of  the  circuit  a  is  1.  □ 


5.  A  Parallel  Algorithm  for  linn  Matching 


Unification  is  a  practical  sequential  algorithm  for  matching  since  unification  can  he  done  in 
linear  time.  However,  unification  is  not  a  good  parallel  approach  to  matching.  We  show  how 
MATCII(G.u.v)  can  be  computed  in  log^n  parallel  time  using  polynomial!)  many  processors.  In 
addition,  we  prove  that  MATCH(G.u.v)  is  in  co-NIOGSPACE.  If  G  is  a  simple  dag  then 
MAICH(G.u.v)  is  actually  in  DLOGSPACE. 

When  we  wish  to  determine  whether  s  matches  t.  we  will  assume  w.I.o.g.  that  no  variables 
appear  in  t.  In  Section  6  we  further  clarify  the  relationship  between  matching  and  unification.  Since 
MATCH(G.u.v)  is  the  same  as  UNlFY(G.u.v)  when  no  variables  appear  in  ty.  we  know  that  tu 
matches  ty  iff  there  is  a  homogeneous  c-c  relation  -  on  G  with  u-v.  A  refinement  of  this 
characterization  of  term  matching  suggests  an  efficient  parallel  algorithm. 

tamma  1:  Let  G  be  a  labeled  dag  with  nodes  u  and  v.  and  let  the  subgraph  of  G  induced  by 
the  descendants  of  v  have  no  nodes  labeled  with  variables.  Let  R  be  the  minimal  correspondence 
relation  on  G  with  uRv,  S  be  the  minimal  equivalence  relation  containing  R.  and  T  be  the  minimal 
correspondence  relation  containing  S.  Then  ty  matches  ty  iff  T  is  homogeneous. 

Proof:  If  ^  matches  ty  then  since  ty  and  ty  arc  unifiablc,  the  minimal  c-e  relation  ~  with  u~v 
is  homogeneous.  Since  ~  must  contain  T,  it  follows  that  T  is  homogeneous. 

For  the  converse,  suppose  that  T  is  homogeneous.  We  will  define  a  substitution  a  such  that 
<r(ty)  =  ty.  Let  Gy.Gy  be  the  subgraphs  of  descendants  of  u.v  respectively.  We  first  show  that  for 
every  node  x  in  Gu  there  is  a  node  y  in  Gv  such  that  xRy.  If,  on  the  contrary,  there  is  some  x  in 
Gu  without  xRy  for  any  y  in  Gy,  then  let  w  be  the  last  node  in  some  path  from  u  to  x  with  wRz 
for  some  z  in  Gv.  Since  w  has  a  son,  labcl(w)  is  a  k*ary  function  symbol  for  some  k>0.  By  similar 
reasoning,  Iabel(z)  is  a  zero-ary  function  symbol.  But  then  labcl(w)*label(z)  and  hence  T  is  not 
homogeneous.  It  follows  from  this  contradiction  that  every  S-equivalcnce  class  contains  at  least  one 
node  from  Gv. 

For  each  S-equivalcnce  class  E,  pick  some  node  e  from  Gy.  If  w  is  another  Gv  node  in  E,  then 
since  T  is  homogeneous  and  no  variables  appear  in  Gy,  we  can  argue  that  =  tg  (here  we  have 
the  problem  EQUAL).  We  now  define  the  substitution  a.  For  any  variable  x  in  ty,  let  E  be  the  S- 
equivalcnce  class  of  the  node  labeled  x  and  define  <r(x)  =  te.  It  is  easy  to  check  by  induction  on 
the  height  of  a  node  w  in  Gu  that  if  wRz,  then  o^)  =  tj.  Thus  <r(ty)  =  ty  and  ty  matches  ty.D 

Given  any  relation,  we  can  find  the  minimal  correspondence  relation  R  containing  it,  in  log2n 
parallel  time  and  n^1)  processors  on  a  PRAM,  using  a  transitive  closure  algorithm  [Ch],  If  G  is  a 
labeled  dag  with  n  nodes,  we  define  an  n^  by  boolean  correspondence  matrix  C q.  We  associate 
each  (unordered)  pair  of  nodes  of  G  with  a  row  and  a  column  of  Cq  and  define  the  entries  of  Cq: 

Cq({u,  v),  {x,  y})  si  iff  x  and  y  are  u  and  v  or  corresponding  sons  of  u  and  v. 

Lemma  2:  Let  G  be  a  labeled  dag  with  nodes  u  and  v,  and  let  R  be  the  min.  correspondence 
relation  sx  uRv.  Then  xRy  iff  the  ({u,  v),  {x,  y})  entry  of  Cq’s  transitive  closure  equals  1.  □ 


Now  given  relation  K,  we  can  lind  (lie  minimal  equivalence  relation  S  containing  K  using  a 
connected  components  algorithm.  It  is  well-known  that  connected  components  can  be  computed  in 
log^n  parallel  time  and  n®^  processors  on  a  PRAM  jCHSJ. 

Since  computing  correspondence  relations  twice,  connected  components  once  and  testing  for 
homogeneity  arc  sufficient  to  decide  matching,  we  have  that  MATCH(G.u.v)  can  be  computed  in 
log^n  parallel  time  and  n™^  processors  on  a  PRAM  (or  equivalently  MATCH  €  NC). 

In  fact,  we  can  show  somewhat  tighter  complexity  upper  bounds,  since  Dl.OGSPACK  C 
NI.OGSPACE  Q  NC: 

Theorem  4:  The  set  of  <G,u.v>  such  that  MATCH(G.u.v)  =  false  is  in  NI.OGSPACH. 
Furthermore,  if  G  is  a  simple  dag.  then  this  recognition  problem  is  in  DLOGSPACE. 

Proof:  Let  G  be  a  dag  with  MATCH(G.u.v)=y??/jr.  L.ct  R.S.T  be  relations  on  the  nodes  of  G 
as  in  the  statement  of  Lemma  1.  By  Lemma  1.  there  must  be  nodes  x  and  y  of  G  such  that  xTy. 
but  labcl(x)  and  labcl(y)  arc  two  different  function  symbols.  We  show  that  there  is  a  log-space 
bounded  nondctcrministic  luring  machine  M-p  capable  of  guessing  all  pairs  (x.y)  such  that  xTy, 
and  checking  whether  x  and  y  have  the  same  labels.  Thus  recognizing  the  <G,u,v>’s,  such  that, 
MATCH(G,u.v)  s  true  is  a  problem  in  co*NLOGSPACE  (also  a  subset  of  the  class  NC). 

To  begin  with,  let  Mr  be  a  nondcterministic  machine  that  starts  with  the  pair  (u,v)  on  its 
worktape.  A  move  of  Mr  consists  of  replacing  a  pair  (x,y)  with  a  pair  (Xj.yj)  of  corresponding  sons 
of  x  and  y.  Clearly  Mr  is  capable  of  guessing  (x,y)  iff  xRy. 

We  now  define  a  nondetcrministic  machine  Mg  using  Mr.  The  machine  Mg  begins  by  running 
Mr  some  nondetcrministic  number  of  steps  to  guess  a  pair  (x.y).  Subsequently.  Mg  repeats  the 
following  3  steps  nondetcrministically: 

(1)  If  one  pair  (x,y>  or  two  pairs  (x,y)  (w,z)  are  on  the  worktape,  then  it  may  replace  (x.y)  by  (y.x). 

(2)  If  (x,y),  (yj)  are  on  the  worktape,  then  it  may  replace  both  by  the  single  pair  (x,z). 

(3)  If  the  single  pair  (x.y)  is  on  the  worktape,  then  it  may  run  Mr  some  number  of  steps  to  guess 
(w,z)  and  end  up  with  both  pairs  (w,z),  (x,y)  on  the  worktape. 

With  these  primitive  steps  Mg  may  guess  (x,y)  iff  xSy. 

Finally,  we  build  Mj  from  Mg.  This  machine  behaves  just  like  Mr.  but  instead  of  starting  with 
(u,v),  starts  with  any  pair  (x,y)  that  Mg  is  capable  of  guessing.  This  concludes  the  proof  of  the  first 
part  of  the  theorem,  which  in  a  way  describes  the  PRAM  algorithm  sketched  above,  but  from  the 
point  of  view  of  nondeteiministk  log-space. 

If  G  is  a  simple  dag,  then  Mr  can  easily  be  made  a  deterministic  depth-first  enumerator  of 
pairs  (x,y).  This  machine  MDr  always  maintains  the  pair  immediately  proceeding  the  current  one, 
so  thit  it  can  backtrack  from  leaf  nodes.  Backtracking  from  internal  nodes  is  staightforward  since 
each  has  indegrec  1. 

Using  a  log-space  preprocessor  we  can  treat  the  subgraph  rooted  at  v  as  a  tree.  Recall  that  this 
graph  has  no  variables,  so  that  all  we  need  to  do  is  duplicate  leaves  labeled  wi(h  constants.  By  doing 
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this  wc  limit  the  number  or  times  step  (2)  of  Ms  must  be  repented  to  only  two.  Ihus  we  can 
construct  a  deterministic  machine  Mi)§  that  enumerates  all  (x.y)  such  that  xSy.  Finally,  wc  build  a 
deterministic  Ml>p  from  MI)r  and  Ml)§  as  before.  □ 

A  corollary  of  I'hcorcm  4  is  that  for  simple  dags  deciding  whether  MATCH(G.u,v)=/rur  is  also 
in  DLOGSPACK,  since  Dl.OGSPACK  is  closed  under  complement.  From  the  analysis  in  [FWJ  it 
also  follows  that  this  problem  can  be  solved  in  0(log  n)  parallel  time  on  a  PRAM. 


6.  Conclusions  and  Open  Problems 

Wc  have  demonstrated  that  several  versions  of  unification  are  complete  for  P.  This  suggests,  by 
way  of  the  parallel  computation  thesis,  that  unification  is  inherently  sequential.  It  is  unlikely  that 
significant  improvements  in  the  speed  of  theorem  proven,  interpreters  for  logic  programs,  and  the 
like  will  be  brought  about  by  the  development  of  parallel  unification  algorithms.  However,  for  the 
special  case  of  term  matching,  the  prospects  arc  much  brighter.  Term  matching  can  be  accomplished 
in  log  n  or  log^n  parallel  time,  depending  on  whether  the  input  is  in  the  form  of  a  simple  dag. 

We  might  also  point  out  that  unification  of  terms  s  and  t  is  complete  for  P  even  if  s  and  t  do 
not  contain  any  variables  in  common  (this  is  different  from  t  having  no  variables).  Also,  if  s  and  t 
are  unifiable  this  does  not  imply  that  s  matches  t  or  that  t  matches  s.  However,  if  s  matches  t  then  s 
and  t  are  unrestricted  unifiable.  If  s  matches  t  and  t  matches  s  they  are  unifiable. 

Intuitively,  congruence  closure  appears  to  be  a  "dual”  of  unification.  It,  too,  is  complete  for  P. 
As  a  consequence,  various  congruence  closure  problems,  such  as  the  decision  problem  for  the  first- 
order  quantifier-free  theory  of  equality  (DST]  are  not  conducive  to  extremely  hut  parallel  solutions. 

There  are  remarkable  similarities  between  the  sequential  algorithms  for  unification  and  testing 
equivalence  of  deterministic  finite  automata.  However,  the  inequivalence  of  deterministic  finite 
automate  can  be  detected  nondeterministically  using  only  logarithmic  space.  A  machine  can  see  that 
two  automata  A}  and  A2  are  equivalent  by  guessing  an  input  string,  character  by  character  and 
simulating  the  actions  of  both  machines  as  it  goes.  If  one  ends  up  in  an  accept  state  while  the  other 
rejects,  then  the  two  are  clearly  different.  If  Aj  and  A2  differ,  then  some  sequence  of  characters 
must  surely  uncover  this.  Thus  unification  is  subtly,  but  fundamentally  different  from  this  "almost 
identical"  problem. 

Some  interesting  open  problems  remain  unresolved,  namely;  (1)  lower  bounds  fix  die 
complexity  of  MATCH  and  EQUAL,  or  can  our  upper  bounds  be  improved,  (2)  the  number  of 
processors  used  in  the  transitive  closure  of  a  correspondence  matrix  is  unrealistically  large,  and  H 
would  be  of  seme  practical  significance  to  decrease  it  to  even  n*,  and  finally  (3)  what  is  the 
complexity  of  commutative  matching,  i.e„  if  function  symbols  stand  for  commutative  operations. 
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1.  A®B 

Z  C®D,  C®E,  H®F,  L®G  (propagation) 

3.  D®E  (transitivity) 

4.  H®I  (propagation) 

5.  F®I  (transitivity) 

6.  L®J  (propagation) 

7.  J®G  (transitivity) 

8.  M®K  (propagation) 

9.  ununifiable  because  M  and  K  have  distinct  labels  gj  and  g2- 


Figure  2:  illustrating  naive-unification 


(a)  OR  subgraph 


o5rti  out2.  on,.  •0uT4j 

(b)  AND  subgraph 


(c)  example  use  of  PATCH 


(d)  putting  everything  together 


Figure  5:  Theorem  2  subgraphs 
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